CRAFT: ClusteR-specific Assorted Feature selecTion
نویسندگان
چکیده
We present a framework for clustering with cluster-specific feature selection. The framework, CRAFT, is derived from asymptotic log posterior formulations of nonparametric MAP-based clustering models. CRAFT handles assorted data, i.e., both numeric and categorical data, and the underlying objective functions are intuitively appealing. The resulting algorithm is simple to implement and scales nicely, requires minimal parameter tuning, obviates the need to specify the number of clusters a priori, and compares favorably with other methods on real datasets.
منابع مشابه
An Evaluation of Feature Selection Methods for Multiclass Learning in Bio Informatics
Traditional data mining techniques such as classification or clustering have demonstrated achievement in datasets which has multiple instances in singly relation but while extreme point of dimensionality or complex dependencies presents in the data it fails to offer accuracy and correctness. In solution to this, Feature (attribute/variable) selection techniques since last two decades have verif...
متن کاملA survey of variable selection methods and multiclass learning in bio informatics
Feature selection based data mining methods is one of the most important research directions in the fields of machine learning in recent years. This paper presents a review of assorted feature selection methods named filter, wrapper and embedded and multiclass classifiers like support vector machines (SVM), decision tree, averaged perceptron and neural network. Additionally it conveys an assess...
متن کاملClustering Complex Data with Group-Dependent Feature Selection
We describe a clustering approach with the emphasis on detecting coherent structures in a complex dataset, and illustrate its effectiveness with computer vision applications. By complex data, we mean that the attribute variations among the data are too extensive such that clustering based on a single feature representation/descriptor is insufficient to faithfully divide the data into meaningful...
متن کاملUnsupervised Personalized Feature Selection
Feature selection is effective in preparing high-dimensional data for a variety of learning tasks such as classification, clustering and anomaly detection. A vast majority of existing feature selection methods assume that all instances share some common patterns manifested in a subset of shared features. However, this assumption is not necessarily true in many domains where data instances could...
متن کاملSpecial Issue on Advances in Intelligent Systems
This special issue consists of five papers focused on recent developments in the field of Intelligent Systems. A worldwide recognized event in this field is the “International Conference on Intelligent Systems, Design and Applications” series, whose last year edition was held in Pisa, Italy, November 30 December 2, 2009. An assorted list of outstanding contributions to that conference was selec...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016